Combining Per-frame and Per-track Cues for Multi-person Action Recognition
نویسندگان
چکیده
We propose a model to combine per-frame and per-track cues for action recognition. With multiple targets in a scene, our model simultaneously captures the natural harmony of an individual’s action in a scene and the flow of actions of an individual in a video sequence, inferring valid tracks in the process. Our motivation is based on the unlikely discordance of an action in a structured scene, both at the track level and the frame level (e.g ., a person dancing in a crowd of joggers). While we can utilize sampling approaches for inference in our model, we instead devise a global inference algorithm by decomposing the problem and solving the subproblems exactly and efficiently, recovering a globally optimal joint solution in several cases. Finally, we improve on the stateof-the-art action recognition results for two publicly available datasets.
منابع مشابه
Leveraging Structure in Activity Recognition: Context and Spatiotemporal Dynamics
Title of dissertation: LEVERAGING STRUCTURE IN ACTIVITY RECOGNITION: CONTEXT AND SPATIOTEMPORAL DYNAMICS Sameh Khamis, Doctor of Philosophy, 2015 Dissertation directed by: Larry S. Davis Department of Computer Science Activity recognition is one of the fundamental problems of computer vision. An activity recognition system aims to identify the actions of humans from an image or a video. This pr...
متن کاملRagdolls in Action – Action Recognition by 3d Pose Recovery from Monocular Video
We present a novel approach to reconstruct and track articulated objects, specifically humans, in 3D from monocular videos for action recognition, by combining techniques from both image processing and 3D computer animation. The goal is to establish a system that is able to recognize basic actions (like walk, run) from frame to frame in a scene with more than one person. In a first step a featu...
متن کاملAction Change Detection in Video Based on HOG
Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...
متن کاملContinuous Action Recognition by Action-specific Motion Models
This paper proposes the models of human motion prior with multiple actions for action recognition in videos. A training sequence of each action, such as walking and jogging, is separately recorded by a motion capture system and modeled independently. Unlike existing approaches with similar motion prior models, our method uses the multiple models simultaneously for particle filtering in order to...
متن کاملEar Recognition from One Sample Per Person
Biometrics has the advantages of efficiency and convenience in identity authentication. As one of the most promising biometric-based methods, ear recognition has received broad attention and research. Previous studies have achieved remarkable performance with multiple samples per person (MSPP) in the gallery. However, most conventional methods are insufficient when there is only one sample per ...
متن کامل